HMM-neural network monophone models for computer-based articulation training for the hearing impaired
نویسندگان
چکیده
A visual speech training aid for persons with hearing impairments has been developed using a Windows-based multimedia computer. In previous papers, the signal processing steps and display options have been described for giving real-time feedback about the quality of pronunciation for 10 steady-state American English monopthong vowels (/aa/, /iy/, /uw/, /ae/, /er/, /ih/, /eh/, /ao/, /ah/, and /uh/). This vowel training aid is thus referred to as a Vowel Articulation Training Aid (VATA). In the present paper, methods are described to develop a monophone-based Hidden Markov Model/Neural Network recognizer such that real time visual feedback can be given about the quality of pronunciation of short words and phrases. Exp erimental results are reported which indicate a high degree of accuracy for labeling and segmenting the CVC database developed for "training" the display.
منابع مشابه
Two Novel Learning Algorithms for CMAC Neural Network Based on Changeable Learning Rate
Cerebellar Model Articulation Controller Neural Network is a computational model of cerebellum which acts as a lookup table. The advantages of CMAC are fast learning convergence, and capability of mapping nonlinear functions due to its local generalization of weight updating, single structure and easy processing. In the training phase, the disadvantage of some CMAC models is unstable phenomenon...
متن کاملImproving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM
Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...
متن کاملDiscriminative and Maximum Likelihood Classifiers for Computer-Based Visual Feedback for Speech Training for the Hearing Impaired
A visual speech training aid for persons with hearing impairments has been developed using a Windows-based multimedia computer. The training aid provides real time visual feedback as to the quality of pronunciation for 10 steady-state American English monopthong vowel phonemes (/aa/, /iy/, /uw/, /ae/, /er/, /ih/, /eh/, /ao/, /ah/, and /uh/). This training aid is thus referred to as a Vowel Arti...
متن کاملEvaluation of the Hidden Markov Model for Detection of P300 in EEG Signals
Introduction: Evoked potentials arisen by stimulating the brain can be utilized as a communication tool between humans and machines. Most brain-computer interface (BCI) systems use the P300 component, which is an evoked potential. In this paper, we evaluate the use of the hidden Markov model (HMM) for detection of P300. Materials and Methods: The wavelet transforms, wavelet-enhanced indepen...
متن کاملA Hybrid Optimization Algorithm for Learning Deep Models
Deep learning is one of the subsets of machine learning that is widely used in Artificial Intelligence (AI) field such as natural language processing and machine vision. The learning algorithms require optimization in multiple aspects. Generally, model-based inferences need to solve an optimized problem. In deep learning, the most important problem that can be solved by optimization is neural n...
متن کامل